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Abstract 

Gategory theory provides a compact method of encoding mathematical 
structures in a uniform way, thereby enabling the use of general theorems 
on, for example, equivalence and universal constructions. In this article we 
develop the method of additional structures on the objects of a monoidal 
Kleisli category. It is proposed to consider any uniform class of information 
transformers (ITs) as a family of morphisms of a category that satisfy certain 
set of axioms. This makes it possible to study in a uniform way different 
types of ITs, e.g., statistical, multivalued, and fuzzy ITs. Proposed axioms 
dehne a category of ITs as a monoidal category that contains a subcategory 
(of deterministic ITs) with hnite products. Besides, it is shown that many 
categories of ITs can be constructed as Kleisli categories with additional 
structures. 


1 Introduction 


Currently the growing interest is attracted to various mathematical ways of 
describing uncertainty, most of them being different from the probabilistic 
one, (e.g., based on the apparatus of fuzzy sets). For adequate theoretical 
study of the corresponding “nonstochastic” systems of information trans¬ 
forming and, in particnlar, for the stndy of important notions, snch as snffi- 
ciency, informativeness, etc., we need to develop an approach general enongh 
to describe different classes of information transforming systems in a uniform 
way. 

It is convenient to consider different systems that take place in infor¬ 
mation acqniring and processing as particular cases of so-called information 
transformers (ITs). Besides, it is useful to work with families of ITs in which 
certain operations, e.g., sequential and parallel compositions are dehned. 

It was noticed fairly long ago [1-5], that the adequate algebraic structure 
for describing information transformers (initially for the stndy of statistical 
experiments) is the strnctnre of category [6-9]. 

Definition 1.1 A category is a quadruple (Ob, Horn, id, o) consisting of: 

(Cl) a class Ob of objects; 

(C2) for each ordered pair {A,B) of objects a set Hom(A, H) of mor- 
phisms; 

(C3) for each object A a morphism id/i G Hom(yl, A), the identity of A; 

(Cj) a composition law associating to each pair of morphisms f G Hom(A, B) 
and g G Hom(il, C) a morphism g o f e Hom(yl, C); 
which is such that: 

(Ml) ho [g o f) = {ho g) o f for all f G Hom(yl, B), g E Hom(il, C) and 
h E Hom(C',D); 

(M2) ids o / = / o id^ = / for all f E Hom{A, B); 

(M3) the sets Hom(74,il) are pairwise disjoint. 

This last axiom is necessary so that given a morphism we can identify its 
domain A and codomain B, however it can always be satished by replacing 
Hom(yl, B) by the set Hom(A, B) x ({A}, {5}). 

A morphism a: A ^ B is called isomorphism if there exists a morphism 
h: B ^ A snch that a o b = ij^ and b o a = i^. In this case objects A and B 
are called isomorphic. 

Morphisms a: T> A and b: V B are called isomorphic if there exists 
an isomorphism c: A ^ B such that co a = b. 



An object Z is called terminal object if for any object A there exists a 
unique morphism from A to Z, which is denoted z^: A ^ Z in what follows. 

A category D is called a subcategory of a category C if Ob(D) C Ob(C), 
Ar(D) C Ar(C), and morphism composition in D coincide with their com¬ 
position in C. 

It is said that a category has (pairwise) products if for every pair of 
objects A and B there exists their product, that is, an object A x B and a 
pair of morphisms g: AxB A and Ax B ^ B, called projections, 
such that for any object V and for any pair of morphisms a: V ^ A and 
b: V ^ B there exists a unique morphism c: V ^ A x B, satisfying the 
following conditions: 






We call such morphism c the product of morphisms a and b and denote it 
a*b. 

It is easily seen that existence of products in a category implies the fol¬ 
lowing equality: 

{a*b) o d = {a o d) * {b o d). (2) 

In a category with products, for two arbitrary morphisms a: A C and 
b: B ^ V one can dehne the morphism a x b: 

axh\ AxB-^CxV, a x b'^ {a o * [b o (3) 

This dehnition and (1) obviously imply that the morphism c = ax6 satisfy 
the following conditions: 






Moreover, c = a x b is the only morphism satisfying conditions (4). 

It is also easily seen that (2) and (3) imply the following equality: 

{a xb) o [c* d) = {a o c) * {b o d). (5) 

Suppose AxB and B x A are two products of objects A and B taken in 
different order. By the properties of products, the objects AxB and B x A 
are isomorphic and the natural isomorphism is 


Ax B ^ B X A, 


def 

^A,B - ^A,B * ^A,B- 


( 6 ) 



Moreover, for any object T> and for any morphisms a: V A and 6: P —> 
B, the morphisms a*b and b * a are isomorphic, that is, 

g o (a * 6) = b * a. (7) 

Similarly, by the properties of products, the objects (A x B) x C and 
A X (B X C) are isomorphic. Let 

^ABC- X -S) X C ^ M X (i3 X C) 
be the corresponding natural isomorphism. Its “explicit” form is: 

^ ('^A,5° '^Ax5,C^ * (^(^A,B° '^AxB,C^ * ^AxB,^ ' 

Then for any object V and for any morphisms a: V A, b'- ^ ^ and 
c: V ^ C we have 


* b) * = a * {b * c). (9) 

Examples. 

1.1. The classic example is Sets, the category with sets as objects and 
functions as morphisms, and the usual composition of functions as compo¬ 
sition. But lots of the time in mathematics one is some category or other, 
e.g.: 

Vectfc — vector spaces over a held k as objects; /c-linear maps as mor¬ 
phisms; 

Group — groups as objects, homomorphisms as morphisms; 

Top — topological spaces as objects, continuous functions as morphisms; 

Diff — smooth manifolds as objects, smooth maps as morphisms; 

Ring — rings as objects, ring homomorphisms as morphisms; 
or in physics: 

Symp — symplectic manifolds as objects, symplectomorphisms as mor¬ 
phisms; 

Poiss — Poisson manifolds as objects, Poisson maps as morphisms; 

Hilb — Hilbert spaces as objects, unitary operators as morphisms. 

1.2. The typical way to think about symmetry is with the concept of a 
’’group”. But to get a concept of symmetry that’s really up to the demands 
put on it by modern mathematics and physics, we need — at the very least — 
to work with a ’’category” of symmetries, rather than a group of symmetries. 



To see this, first ask: what is a category with one object? It is a — 
’’monoid”. The ’’usual” definition of a monoid is like this: a set M with an 
associative binary product and a unit element 1 such that al = la = a for 
all a in M. Monoids abound in mathematics; they are in a sense the most 
primitive interesting algebraic structures. 

To check that a category with one object is ’’essentially just a monoid”, 
note that if our category C has one object x, the set Hom(a;, x) of all mor- 
phisms from a; to a; is indeed a set with an associative binary product, namely 
composition, and a unit element, namely id^,. 

How about categories in which every morphism is invertible? We say a 
morphism / : a; —?/ in a category has inverse g \ y ^ x '\i f o g = id^ and 
g o f = ida;. Well, a category in which every morphism is invertible is called 
a ’’groupoid”. 

Finally, a group is a category with one object in which every morphism 
is invertible. It’s both a monoid and a groupoid! 

When we use groups in physics to describe symmetry, we think of each 
element g of the group G as a ’’process”. The element 1 corresponds to the 
’’process of doing nothing at all”. We can compose processes g and h — do 
h and then g — and get the product g o h. Crucially, every process g can be 
’’undone” using its inverse g~^. 

So: a monoid is like a group, but the ’’symmetries” no longer need be 
invertible; a category is like a monoid, but the ’’symmetries” no longer need 
to be composable. 

1.3. The operation of ’’evolving initial data from one spacelike slice to 
another” is a good example of a ’’partially defined” process: it only applies 
to initial data on that particular spacelike slice. So dynamics in special or 
general relativity is most naturally described using groupoids. Only after 
pretending that all the spacelike slices are the same can we pretend we are 
using a group. It is very common to pretend that groupoids are groups, 
since groups are more familiar, but often insight is lost in the process. Also, 
one can only pretend a groupoid is a group if all its objects are isomorphic. 
Groupoids really are more general. 

In the work [10] we undertake an attempt to formulate the method of 
categorical extension of the theory of a group G as follows: 

Let G be a group. Then G is merely the visible part of a certain cat¬ 
egory K which is invisible to the naked eye. More precisely, there exists 
a certain category K (the train of the group G) such that the group itself 
is the automorphism group of a certain object V, while the semigroup T 



is the semigroup of endomorphisms of this same object. Furthermore, each 
representation p of G' on a space H can be extended to a representation of 
the category K. In other words, for each objects W of the category K we 
can construct a linear space T(W) and for each morphism P : W ^ W we 
can construct a linear operator r(P) : T(W) T(W') such that for any 
morphisms P : W ^ W and Q : W W we have 

t{QP) = t{Q)t{P) 

with T{y) = P, and for all G G the operators T{g) and p{g) are the 
same. 

We note that all the spaces T{W) and all the operators r(p) “grow out 
of” the one and only representation p of G and the one and only space H. 

So: in contrast to a set, which consists of a static collection of ’’things”, 
a category consists not only of objects or ’’things” but also morphisms which 
can viewed as ’’processes” transforming one thing into another. Similarly, 
in a 2-category, the 2-morphisms can be regarded as ’’processes between 
processes”, and so on. The eventual goal of basing mathematics upon omega- 
categories is thus to allow us the freedom to think of any process as the sort 
of thing higher-level processes can go between. By the way, it should also be 
very interesting to consider ”Z-categories” (where Z denotes the integers), 
having j-morphisms not only for j = 0,1, 2,... but also for negative j. Then 
we may also think of any thing as a kind of process. 

Definition 1.2 Let X and Y he two eategories. A funetor from X to Y 
is a family of funetions F whieh assoeiates to eaeh object A in 'K. an ob¬ 
ject FA in Y and to each morphism f G Homx(^,P) a morphism Ff G 
}iomY{FA,FB), and which is such that: 

(FI) F{g of) = FgoFf for all f G Homx(kl, B) and g G HomY(P, G); 

(F2) FidA = id^A for all A G Ob(X). 

There is the dehnition of left and right adjoint functors. In the following 
we shall need two such adjoint constructions. First, in a given category the 
left adjoint of the diagonal functor (if it exists) is called the coproduct and 
the right adjoint (if it exists) is called the product: in Sets the product is 
the Cartesian product and the coproduct is the disjoint union. Second, let 
the category X be concrete over some category A in the sense that there 
exists a faithful functor F from X to A, usually called the forgetful functor. 
The left adjoint to this functor (if it exists) is then called the free functor. 



A standard example is the forgetful functor from complete metric spaces to 
metric spaces, whose left adjoint in the completion functor. On the next 
higher level of abstraction the notion of a natural transformation is settled. 
It is a kind of a function between functors and is dehned as follows. 

Definition 1.3 Let F ; X — Y and G :'K be two functors. A natural 
transformation a : F ^ G is given by the following data. 

For every object A in IK there is a morphism aA '■ F{A) — G{A) in Y 
such that for every morphism f : A ^ B in K the following diagram is 
commutative 

F{A) ^ G{A) 

F{f) i i G{f) 

F{B) ^ G{B). 

Commutativity means (in terms of equations) that the following compo¬ 
sitions of morphisms are equal: G{f) o ua = o:b ° F{f). 

The morphisms aA, A E Obj(A), are called the components of the natural 
transformation a. 

Examples. 

1.4. So, we can certainly speak, as before, of the ’’equality” of categories. 
We can also speak of the ’’isomorphism” of categories: an isomorphism be¬ 
tween C and D is a functor F : C —D for which there is an inverse functor 
G : D —> C. he., FG is the identity functor on C and GF the identity on D, 
where we dehne the composition of functors in the obvious way. But because 
we also have natural transformations, we can also dehne a subtler notion, the 
’’equivalence” of categories. An equivalence is a functor F : C —> D together 
with a functor G : D —C and natural isomorphisms a : FG Iq and 
b : GF — > Ib- a ’’natural isomorphism” is a natural transformation which 
has an inverse. 

1.5. As we can ’’relax” the notion of equality to the notion of isomorphism 
when we pass from sets to categories, we can relax the condition that FG 
and GF equal identity functors to the condition that they be isomorphic to 
identity functor when we pass from categories to the 2-category Cat. We 
need to have the natural transformations to be able to speak of functors 
being isomorphic, just as we needed functions to be able to speak of sets 
being isomorphic. In fact, with each extra level in the theory of n-categories, 
we will be able to come up with a still more rehned notion of ”n-equivalence” 
in this way. 



Analysis of general properties for the classes of linear, multivalued, and 
fuzzy information transformers, studied in [5, 11-18], allowed to extract gen¬ 
eral features shared by all these classes. Namely, each of these classes can be 
considered as a family of morphisms in an appropriate category, where the 
composition of information transformers corresponds to their “consecutive 
application.” Each category of ITs (or IT-category) contains a subcategory 
(of so called, deterministic ITs) that has products. Moreover, the operation 
of morphism product is extended in a “coherent way” to the whole category 
of ITs. 

The works [19-22] undertook an attempt to formulate the method of ad¬ 
ditional structures as a set of “elementary” axioms for a category, which 
would be sufficient for an abstract expression of the basic concepts of the 
theory of information transformers and for study of informativeness, deci¬ 
sion problems, etc. This paper proposes another, signihcantly more compact 
axiomatic for a category of ITs. According to the method of additional struc¬ 
tures on the objects of a category of ITs it is dehned in effect as a monoidal 
category [6, 8], containing a subcategory (of deterministic ITs) with hnite 
products. 

Among the basic concepts connected to information transformers there 
is one that plays an important role in the uniform construction of a wide 
spectrum of IT-categories — the concept of distribution. Indeed, fairly often 
an IT a: M can be represented by a mapping from A to the “space of 
distributions” onB (see, e.g., [11-18]). For example, a probabilistic transition 
distribution (an IT in the category of stochastic ITs) can be represented by a 
certain measurable mapping from A to the space of distributions on B. This 
observation suggests to construct a category of ITs as a Kleisli category [6,23], 
arising from the following components: an obvious category of deterministic 
ITs; a functor that takes an object A to the object of “distributions” on M; 
and a natural transformation of functors, describing an “independent product 
of distributions”. 

It appears that rather general axiomatic theory, obtained this way, makes 
it possible to express in terms of IT-categories basic concepts for information 
transformers and to derive their main properties. 

Of cause, the most developed theory of uncertainty is probability the¬ 
ory (and statistics, based on probability). Certainly, mathematical statistics 
accumulated a rich conceptual experience. It introduced and deeply inves¬ 
tigated such notions as joint and conditional distributions, independence, 
sufficiency, and others. 



At the same time, it appears that all these concepts have very abstract 
meaning and hence, they can be treated in terms of alternative (i.e., not 
probabilistic) approaches to the description of uncertainty. In fact, the basic 
notions of probability theory and statistics, as well as the methodology and 
results, are easily extended to other theories dealing with uncertainty. In [11- 
18] it is shown that a rather substantive decision theory may be constructed 
even on the very moderate basis of multivalued or fuzzy maps. 

The approach developed in this paper allows to express easily in terms 
of IT-categories such concepts as distribution, joint and conditional distribu¬ 
tions, independence, and others. It is shown that on the basis of these con¬ 
cepts it is possible to formulate fairly general statement of decision-making 
problem with a prior information, which generalizes the Bayesian approach 
in the theory of statistical decisions. Moreover, the Bayesian principle, de¬ 
rived below, like its statistical prototype [24], reduces the problem of optimal 
decision strategy construction to a signihcantly simpler problem of hnding 
optimal decision for a posterior distribution. 

Among the most important concepts in categories of ITs is the concept 
of (relative) informativeness of information transformers. There are two dif¬ 
ferent approaches to the concept of informativeness. 

One of these approaches is based on analyzing the “relative positions” 
of information transformers in the corresponding mathematical structure. 
Roughly speaking, one information transformer is regarded as more informa¬ 
tive than another one if with the aid of an additional information transformer 
the former one can be “transformed” to an IT, which is similar to (or more 
“accurate” than) the latter one. In fact, this means that all the informa¬ 
tion that can be obtained from the latter information transformer can be 
extracted from the former one as well. 

The other approach to informativeness is based on treating information 
transformers as data sources for decision-making problems. Here, one infor¬ 
mation transformer is said to be semantically more informative than another 
if it provides better quality of decision making. Obviously, the notion of se¬ 
mantical informativeness depends on the class of decision-making problems 
under consideration. 

In the classical researches of Blackwell [25, 26] the correspondence be¬ 
tween informativeness (Blackwell sufficiency) and semantical informativeness 
(Blackwell informativeness) were investigated in a statistical context. These 
studies were extended by Morse, Sacsteder, and Chentsov [1-4] who applied 
the category theory techniques to their studies of statistical systems. 



It is interesting, that under very general conditions the relations of infor¬ 
mativeness and semantical informativeness (with respect to a certain class of 
decision-making problems) coincide. Moreover, in some categories of ITs it 
is possible to point out one special decision problem, such that the resulting 
semantical informativeness coincides with informativeness. 

Analysis of classes of equivalent (with respect to informativeness) infor¬ 
mation transformers shows that they form a partially ordered Abelian monoid 
with the smallest (also neutral) and the largest elements. 

One of the objectives of this paper is to show that the basic constructions 
and propositions of probability theory and statistics playing the fundamental 
role in decision-making problems have meaningful counterparts in terms of 
IT-categories. Furthermore, some definitions and propositions (for example, 
the notion of conditional distribution and the Bayesian principle) in terms 
of IT-categories often have more transparent meanings. This provides an 
opportunity to look at the well known results from a different angle. What 
is even more signihcant, it makes it possible to apply the methodology of 
statistical decision-making in an alternative (not probabilistic) context. 

Approaches, proposed in this work may provide a background for con¬ 
struction and study of new classes of ITs, in particular, dynamical nonde- 
terministic ITs, which may provide an adequate description for information 
flows and information interactions evolving in time. Besides, a uniform ap¬ 
proach to problems of information transformations may be useful for better 
understanding of information processes that take place in complex artihcial 
and natural systems. 

2 The method of additional structures on the 
objects of a category 

2.1 Basic definitions 

To use the categorical language more effectively we introduce general concept 
of an additional structure on objects of a category. This is the concept of 
concrete category but over any category [19-22]. 

In a category, two objects x and y can be equal or not equal, but they can 
be isomorphic or not, and if they are isomorphic, they can be isomorphic in 
many different ways. An isomorphism between x and y is simply a morphism 
f X ^ y which has an inverse g : y ^ x, such that fog = id^ and gof = id^,. 



In the category Sets an isomorphism is just a one-to-one and onto func¬ 
tion, i.e. a bijection. If we know two sets x and y are isomorphic we know 
that they are ’’the same in a way”, even if they are not equal. But specifying 
an isomorphism f : x ^ y does more than say x and y are the same in a 
way; it specihes a particular way to regard x and y as the same. 

In short, while equality is a yes-or-no matter, a mere property^ an isomor¬ 
phism is a structure. It is quite typical, as we climb the categorical latter 
(here from elements of a set to objects of a category) for properties to be 
reinterpreted as structures. 

Definition 2.1 We tell that a functor F : C ^ C define a additional 
C—structure on objects of the category C if 

1. VX,F G ObiC) the map F : C{X,Y) C'{F{X), F{¥)) ts injective, 

2. 'iX G Ob{C),Y G Ob{C') and an isomorphism u : Y ^ F{X) there is 
an object Y E Ob and an isomorphism u : Y ^ X such that F{Y) = Y 
and F{u) = u. 

Such functor is called a forgetful functor. 

Almost all usual mathematical structures are structure on sets in this 
sense and there are corresponding forgetful functors to the category Sets of 
sets. 

A forgetful functor F : C ^ M{C') dehnes a C-structure on morphisms of 
the category C. 

For our general structures we can dehne usual construction: 

- inverse and direct images of structures; 

- restrictions on subobjects, 

- different products of structures. 

We can dehne the category Str{C) of forgetful functors to the category 
C. It is a full subcategory of the category Cat/C of all categories over C. 

Some properties of structures (= forgetful functors): 

- In the category Str{C) the (bundle) product always exists. It gives a 
“union” structures. 



~ Any functor f : C ^ C' transfers structures to inverse direction, i.e. it 
defines the functor 

f* : Str{C') Str{C) : F ^ f*F. 

- For a forgetful functor F : C ^ C the functors 

(Fo) = Funct{id, F) : Funct{B,C) Funct{B,C') 

(oF) = Funct{F,id) : Funct{C',B) Funct{C,B) 

are forgetful functors. 

~ One of constructions which transfers structure F : C ^ Sets defined 
on sets to objects of any category B, is the functor 

h B ^ Funct{B°, Sets) : B i—> hs- 

Thus we have 

h%C —^ C 
i IF 

B' ^ Sets 

- If a functor A \ B ^ C is injective on morphisms (the condition (1) in 
the dehnition of forgetful functor) then a forgetful functor F : B' ^ C 
and an equivalence i : B ^ B' exist, such that the following diagram is 
commutative 

B —^ C 

i /Fp 

B' 

2.2 Structures on Topological Spaces 

Among of structures on topological spaces we can select that, which is com¬ 
patible with the topology. Let Top be a category of some topological spaces 
with a forgetful functor F : Top ^ Sets. 

The categories associated with a topological space T G Ob(Top) as fol¬ 
lows: 

- The category T(T), where Oh(T(T)) is the set of all open subsets of 
T, and Mor{T{T)) is all their inclusions. 



~ The category (pseudogroup) V{T), where Oh(V(T)) is the set of all 
open subsets of T, and Mor{V{T)) is all their homeomorphisms. 

Functors T(T)° Set are called presheaves of sets on T. Some of them 
are called sheaves. Thus we have the inclusions 

Sh{T) C PreshiT) C Funct{T{T), Sets). 

A Grothendieck topology on a category is dehned by saying which families 
of maps into an object constitute a covering of the object and certain axioms 
are fulhll. A category together with a Grothendieck topology on it is called 
a site. For a site C one dehne the full subcategory Sh{C) C PreshiC) = 
Funct{C°, Set). The objects of Funct{C°, Set) are called presheaves on the 
site C, and the objects of Sh{C) are called sheaves on C. 

For any category there exists the hnest topology such that the all rep¬ 
resentable presheaves are sheaves. It is called the canonical Grothendieck 
topology. Topos is a category which is equivalent to the category of sheaves 
for the canonical topology on them. 

Hence, the topology is already transfered on a category so now it is natural 
to consider on language of toposes and sheaves all questions connected to 
local properties. 

Here we shall not consider local structures on toposes in general, and we 
shall restrict ourselves with the consideration of the elementary case of the 
category Top. 

Definition 2.2 A structure defined by a forgetful functor f -. C ^ Top is 
called a local structure if 

VC G ObjiC) and any inclusion map i : U ^ f{C) of the open subset U 
an object U G Ob{C) and a morphism i G C{U,C) exist such that f{U) = U 
f{i) = i. This C—structure U is denoted by C\U and called a restriction of 
C on U. 

In other words we can restrict ourselves with local structures on open 
subsets. 

For a local structure F : C ^ Top and each object X G Obj(Top) there 
is the presheaf of categories 

T(X)° ^Cat-.Ue^ F-\Ufidu). 


Often this presheaf is a sheaf. 



2.3 Structures on Smooth Manifolds 

Let A4 be the category of smooth (cx)-differentiable) manifolds with forgetful 
functor f : Ai ^ Top, which dehnes a local structure and the presheaves of 
these structures are sheaves. On the category M. there is the tangent functor 
T :M^M: M^T{M). 

Its iterations give us almost all interesting functors on At. Among them 
we shall note the following: 

- The cotangent functor T* : Ai ^ Ai : M T*{M). 

- For a manifold M and natural number k = 0,1,... the functor of A;—jets 
jk . Ai^Ai: N ^ J^{M, N). 

- For a manifold M, x G M, and natural number A; = 0,1, • • • the functor 
of A;—jets at the point x : Ai ^ Ai J^{M, N). 

Any category C of structures on smooth manifolds (or on At/) has an 
additional structure, which give us a possibility to define ’’smooth families of 
morphisms”. 

Definition 2.3 Let M,M',M'' e Ai. A map 

$ : M ^ Ai{M', M") : a; ^ 

is ealled a smooth family of morphisms if there exists a smooth map (f : 
M X M' M" such that 

Vt e M, x' E M' = (p{x,x'). 

Thus we get the class of categories with smooth families and it appears 
the natural condition on functors. 

Definition 2.4 A functor is called a smooth functor if it maps each smooth 
family to a smooth family. 


Of course all functors T,T*, J^, are smooth. 



2.4 Double Categories as additional structure 
on categories 

In any category C with bundle products for some morphisms we can define so- 
called intern categories. This is a monoid in the multiplicative category C//0 
of pairs of (special) morphisms D, R : M ^ O with the bundle product: 

for ^ = {D,R : M ^ O) and = {O', R' : M ^ O) we get ^ ^ = 

(D o tti, i?' o 7r2 : M Xq M' O) where the unit objects id^ : 0 —M and 
idM' : 0 — M' and the following diagram is commutative 


M XqM' M' 


TTl 


R' . 


M —^ O 


So an intern category is an object ^ = {D, R : M— > O) with a multipli¬ 
cation ^ ^ ^ and the unit idu ■ O M. 

Now we consider such intern category as the category Cat of categories 
and will call it as double categories [20]. 


Definition 2.5 A double category D consists of the following: 

(1) A category Dq of objects Obj (Do) and morphisms Mor {Dq) of 0-level. 

(2) A category DI of morphisms Obj{Di) of 1-level and morphisms Mor {Di) 
of 2-level. 

(3) Two functors d,r : Di^Dq. 

(4) A composition functor 


Di X Di ^ Di 

where the bundle product is defined by commutative diagram 

Di Xdq Di 7^ Di 

Til i id 

Di r^ Do 

(5) A unit functor ID : Dq ^ Di, which is a section of d,r. 

There are strong and weak double categories. 

Now we see that for two objects A,B E Obj{Do) there are 0-level mor¬ 
phisms Do{A,B) which we note by ordinary arrows f : A ^ B, and 1-level 



morphisms B), which we note by the arrows ^ : A ^ B for A = d{^) 

and B = r{^). So with a 2-level morphism a : ^ ^ where ^ : A ^ B and 
: A' ^ B' we can associate the following diagram 


A 

d{a) [ 
A' 


i 




B 

i r{a) 
B' 


i a 


and arrow a : d{a) ^ r{a). 

On each level we have the corresponding compositions: 


0-level 

1- level 

2- level 


{A^B ^C) 

^ a l3 

(A^B^C) 

a f} 

if ^ 9^ h) 


9° f■A- 

(3oa-.i 
7]*^: Ai 
P*a: f 


C 
► ? 

c 

■ h 


The composition on 2-level associated with the diagram 


A 


B 


d{a) i 


i r{a) 

i a 

A' 


B' ^ 

- i' 

d{a') i 


[ r{a') 

i a' 

A!' 


B” 



Thus, a double category D consists of 


• fom sets Obj{Do), Mor{Do),Obj{Di), Mor{Di), and eight maps of type 
d, r 

Obj{Di) 1= Mor{Di) 

Obj{Do) ^ Moi{D,) 

• two categories are associated Dq, Di, and almost categories: D(^ 2 ) 
with the set of objects Obj{Do) and the set of morphisms Obj{Di), D(^s) 
with the set of objects Mor{Do) and the set of morphisms Mor{Di), 

• r,d : Zi)( 3 ) —> D(^ 2 ) are almost functors. 



Now we can define for double categories double (category) functors 
and their morphisms, double subcategories , the category DCat of dou¬ 
ble categories, equivalence of double categories, dual double categories 
(changed direction of 1-level morphisms, i.e. d,r are transposed), and so on. 

Definition 2.6 A double category functor F : D D' is a pair Fq : Dq ^ 
Dq, Fi : Di ^ D[ of usual functors such that 

d' o Fi = Fq o d, r' o Fi = Fq o r, 

ve,e' e Obj{D,) : Fi(e*O^Fi(0 *'Fi(0, 

V A e Obj{Do) : F^{IDA)'^IDp^^Ay 


2.5 Examples of Double Categories 

Examples considered bellow show that double categories are sufficiently nat¬ 
ural for mathematics. 


Example 2.1 Bicategories are the partial case of double category D when 
the category Dq is trivial, i. e. has only identical morphisms and composition 
of 1-level and 2-level morphisms are associative. 


Example 2.2 For each category C we have the canonical double category 
Morph{C) of morphisms. Let C be a category, T be the diagram • ^ TC 
be the category of diagrams in C of type T, let Dq = C and Di = TC. The 
functor d maps the diagram f : A ^ B into the object A, the functor r maps 
this diagram into the object B, and so on. It is easy to see that we get a 
double category D which is noted by Morph{C). Here Obj{Di) = Mori^Do) , 
a 2-level morphism f ^ g is a pair {u,v) of morphisms u,v E Mor{C) with 
usual composition from the commutative diagram 


A 


A' 


fi 

B 


V 


if 

B' 


Example 2.3 Let C be a category with bundle products, i.e. for all mor¬ 
phisms u,v to Y the universal sguare 


XxzY 

i 

X 


u 


Y 
i V 
Z 



exists. And let T be the following diagram 


TC be the category of diagrams in C of type T. Now we define the double 
category D with Dq = D and Di = TC. Two functors 

d,r:TC^C, 


where the functor d maps the diagram A ^ M ^ B into the object A, the 
functor r maps this diagram into the object B. The composition: for two 

1-level morphisms ^ = {A M ^ B) : A ^ B and f'={B^ M'' ^ C) : 

B ^ C we define their composition f'o^=(^A M Xb M' C) where 
the bundle product is defined by the universal diagram 


M Xb M' 712 AT 

TTi i i tt' 

M ^ B 


A 2-level morphism is a triple a = {u,v,w) : f f' from the following 
commutative diagram 


M 


B 

TT i 

\ V 


A 


M' 


\ U 

tt' i 



A' 



with the evident composition. 

Example 2.4 Let us consider a multiplicative (tensor) category (C, ^,U,u). 
Then we have the double category with Di = C, and Dq = (*, *), e.c. a trivial 
category with one object and one morphism. The composition is 



Let us consider it in more details. Let (C, U, u) he a multiplicative (tensor) 
category with multiplication 

® ■. C X C ^ C ■. {X,Y) ^ X 

for the functor isomorphism of associativity 

(f : ® o (id, ®) ^ ® o ((g), id) 


we write 

^X,Y,Z ■. X ® (Y ® Z) ^ (X ®Y) ® Z 
so the pentagon is commutative 


X® (F® ((X ® F) (g F) (g TF 

idx <g pyyy/ i H^xyy ® idw T 

X g ((F g F) g W) rx.Y^.w (X g (F g F)) g W 

Then we have the double category D with Dq = C and Di such that 

ObjiDi) = {{X,x)\A,B,X e Obj{C), x : X ® B}. 

So, we write f = (X,x) : A ^ B and for f G Ohj(Di) we denote f = 

(X^,x^), d(f) = A^, r(f) = B^. 2-level morphisms 


X®A 

-Di(^,^0 = {(/i;/ 2 ,/ 3 ) I commutative diagram /a ®/i i 

X' ® A' 



B 


i/2 } 

B' 


and d(fi, /a, /s) = /i, r(/i, /a, /a) = /a- 

Composition Di Di ^ Di is defined as follows 

i e 

for A ^ B ^ B' f o = (A,B',X'X,x”), where x" is the following 


composition 

(X' ®X)®A X' g (X g A) X'®B ^ B'. 

Associativity. For A ^ B ^ B ^ B the left column gives x^"o{^'o^), the 
right column gives X(^//o^')o^ 



Vx",X'®X,A i 
X"® ((X'®X) ® A) 
idx" ® i 

X" ® (X' ®{X® A)) 
idx" ® (idx' ® a;) I 

X" ® (X' ® B) 
idx" ® a;' I 

X" ® B" 

x" [ 

B" 


{{X" ® X') ® X) ® A 
VX"®X’,X,A i 
(X" ® X') ®{X®A) 
idx"®x' ® a; I 

(X" ®X')®B 

‘^X",X',B i 

X" ® (X' ® B) 
idx" ® a;' I 

X" ® B' 
x” i 
B” 


So we have isomorphism 

i^x",x',x, idA', ids') ■ i" o {i' o 0 ^ ° ■CO ° i- 

2.5.1 Bundle of Categories 

Let 99 : X —C be a functor and for all objects U G ObjC and we denote by 
Xu = (p~^{U,idu) the subcategory of F with 

Obj Xu = {u e ObjX I (p{u) = U} , 

Mor Xu = {/ G Mor X \ (p{f) = idu} ■ 

Let (/ : n —> m) G MorX, ip{f : v ^ u) = {g : V U). Then one tells that 
/ is Deseartes’s morphism , or that v is inverse image g*{u) of the object u, 
if Vn' G Obj{Xv) the map 


/* ; Xv{v', v) Xg{v', u) -.h^ f oh 


is a bijection. Here we have 

Xg{v,u) {h G X{v,u) I ip{h) = g.) 



So we have the diagram 


V v' 

i" 

V 

V 

A functor P : JF —C is called a bundle of categories if inverse images allows 
exist and a composition two Descartes morphism is Descartes morphism too. 
Then g* may be transfered to functor J^{U) —> ^(D), and {gi o ^ 2 )* will be 
canonical isomorphic to g^ ° gl- 

Example 2.5 The projection 

El : Mor{Top) ^ Top : {f : X ^ Y) ^ X 

is a bundle of categories. For different structures on topological spaces it is 
not always truth for the category of all morphisms, but may be truth for a 
subcategory. 

Example 2.6 Let Sub be a subcategory in Mor-{Man) consists from sub¬ 
mersions. Then projection 

U 2 : Sub Man -. {f -. X ^ Y) ^ Y 

is a bundle of categories and for each morphism h G Man{B', B) we have 
the functor 0 /inverse image.- 

h* : Subs ^ Subs' -. {f -. M ^ B) ^ {B' Xb M ^ B'). 

The set r(7r) of sections of an submersion tt : M B is the set of morphisms 

Suh{idB, 7i). 

Example 2.7 Let Mod be the category of pairs {R,M) where R is a ring 
and M is a left R-module. Let Rings be the category of rings. Then the 
functor 

Mod —s- Rings : {R, M) 1 —>• R 

is a bundle of categories and for each morphism h G Ring(P', R) we have 
the functor 0 /inverse image.- 

h* : P-mod —P'-mod : M R' M. 


foh 


u 


U 



2.6 Fibers of Functor Morphisms 

The Grothendieck’s definition of a fiber of a functor morphism is applicable 
to morphisms of functors from any category to the category Sets of sets. 
Let F,G : C ^ Set, and ip : F ^ G he their morphism. For each object 
S G ObjiC) and an element a G G{S) the fiber pa of p over a is the following 
functor 

Pa : c/s Sets : f Pa{f), 
where for a morphism f : T ^ S 

Pa{f) = {fieF{T)\G{f)opr{fi) = a}. 

So we have the following diagram 

p^if) C F{T) F{S) 

i [ PS 

G(T) ^ G(^) 3 a. 

3 Multiplicative structures on categories 

3.1 Concepts and state of the art 

The prototype of a category is the category Sets of sets and functions. The 
prototype of a 2-category is the category Cat of small categories and functors. 
Cat has more structure on it then a simple category because we have natural 
transformations between functors. This can be viewed in the following way: 
The extra structure implies that every morphism set Hom(C, D) in Cat is 
actually not only a set but a category itself where composition and identities 
in Cat are compatible with this categorical structure on the Hom-sets (i.e. 
composition and identities are functorial with respect to the structure on the 
Hom-sets). A general category with this kind of extra structure is called a 
2 -category. 

The definition of a 2-category can be put in a more general setting (which 
will be convenient below) by using the language of enriched categories. A 
category C is enriched over a category V if every Horn-set in C has the struc¬ 
ture of an object in V and if composition and identities in C are compatible 
with this extra structure on the Hom-sets. So, a 2-category is a category en¬ 
riched over Cat. Now, the (small) 2-categories again form a category 2-Cat 



and a 3-category can be defined as a category enriched over 2-Cat (indeed, 
2-Cat turns out to be a 3-category itself). In this way we can proceed it¬ 
eratively to define n-categories and then cu-categories as categories involving 
n-categorical structures of all levels. 

A concrete recipe obtaining of monoidal (braided etc) 2-categories via 
Hopf categories is proposed by Crane and Frenkel [27]. Namely, that it 
is supposed the 2-category of module-categories over a Hopf category now 
plays an important role in 4-dimensional topology and TQFT. Although the 
theory of Hopf categories is devised, in general, by Neuchl [28], interesting 
examples are still missing. In particular the Hopf category, underlying the 
Lusztig’s canonical basis [29] of a quantized universal enveloping algebra, is 
not constructed yet. We propose to define it as a family of abelian categories 
of perverse /-adic sheaves equipped with some functors of multiplication and 
comultiplication [30]. These perverse sheaves are equivariant in the sense of 
Bernstein and hunts [31]. 

It turns out that the notions of n-category and cu-category are not general 
enough for several interesting applications. What one gets there are weak 
versions of these concepts (instead of weak n-category sometimes the notions 
bicategory, tricategory, etc. are used). Let us shortly explain what this 
means; In a category it does not make sense to ask for equality of objects 
but the appropriate notion is isomorphism. In the same way, in a 2-category 
we should not ask for equality of morphisms but only for equality up to an 
invertible 2-morphism (the morphisms between the morphisms, e. g. the 
natural transformations in Cat). Applying this to the categorical structure 
itself (i.e. requiring associativity and identity properties only up to natural 
equivalence) leads to the notion of weak 2-category (or bicategory). In the 
same way, we can weaken the structure of an n-category up to the (n— l)-th 
level to obtain a weak n-category. 

The point making this weakening an involved matter is that in general 
we need so called coherence conditions in addition to the weakened laws in 
order to assure that some properties, known from the strict case, hold. E.g., 
to assure that associativity is iteratively applicable (i.e. that we can up to a 
2 -isomorphism rebracket composites involving more than three factors), we 
need a coherence condition stating that even four factors can be rebracketed 
(and the other cases follow then). See the literature given above for the 
details. 

A satisfactory version of a weak n-category for higher n and of a weak 
cu-category was not available for a long time but now there are several ap- 



proaches at hand [32-34], The relationship between these approaches and a 
universal understanding of these structures has still to be achieved. 


3.2 Multiplicative Categories 

Definition 3.1 A multiplication in the category C is an associative func¬ 
tor 

*:CxC^C:iX,Y)^X*Y. 

An associativity morphism for * is a functor isomorphism 

(PX,YY ■ ^ * 0^ * ^ ^ 

such that for any four objects X, Y, Z, T the following diagram is commuta¬ 
tive: 


X *{Y *T)) {X *Y)*{Z *T) X *Y * Z *T 


i idx * ^Y,Z,T 

X*{{Y*Z)*T) 


T ^X,YZ * 

{X*{Y*Z))*T 


X*{Y*{Z*T)) (X*F)>k(Z*T) X*Y*Z*T 


idx*^Y,Z,T 


‘rx,Y,Z*^d.T 


X*{{Y*Z)*T) rx,Y*z,T^ {X*{Y*Z))*T 

An commutativity morphism for * is a functor isomorphism 

f)x,Y ■. X *Y ^Y *X 

such that for any two objects X, Y we have 

^x,Y ° ‘^Y,x = idx*Y ■ X *Y X *Y. 

Morphisms associativity (p and commutativity ip are compatible if for any 
three objects X, X, Z the following diagram is commutative: 


X>k(X*Z) (X*X)*Z Z>k(X*X) 

i idx * i’Y,z T ^Z,X,Y 

X*{Z*Y) {X*Z)*Y {Z*X)*Y 



X*{Y*Z) *Y) * Z Z*{X*Y) 

i idx * 'ipY,z T ^Z,X,Y 

X*{Z*Y) {X*Z)* {Z*X)*Y 

A pair {U, u) where U G Obj{C) and an isomorphism u : U U *U is ealled 
a unit object for C, * if the funetor 

X ^U*X :C^C 

is equivalence of categories. 

Definition 3.2 A multiplicative category is a collection {C, *, (p, fj, U, u). 

If there are some additional structures on category, then it is usually 
assumed that product * and others elements of the collection are compatible 
with these structures. 

3.3 C-monoids or multiplicative objects. 

Monoidal categories and Monoids. Comonoids 

Let C = {C,*,(p,ip,U,u) be a multiplicative category. An multiplicative 
object in C or C-monoid is an object M G Obj{C) with multiplication 
p, ■. M * M ^ M ■. [m, m') i—> p{m, m') and an unit e : U ^ M such that the 
following axioms are faithful; 

(1) Associativity: the following diagram is commutative 

I idM * h i h * idlM 

M*M ^ M ^ M*M 

(2) Unit: the following diagram is commutative 

M —^ U*M M*U 

II ie*idM i idM * £ 

M M*M = M*M 



Example 3.1 Let R be a commutative ring. The category i?-mod of R - 
modules is a multiplicative category under the tensor product with the 
unit object is the left R-module R. Multiplicative objects in the category is 
R—algebras with units. 

Example 3.2 A small multiplicative category C is a multiplicative object of 
the multiplicative category Sets//Ohj{C). 

Multiplicative structures may be described in categories as monoids in a 
monoidal category. 

A monoidal category (C, K,ip, .. .) consists of: 

® : C X C ^ C, K G OhC ~ the unit object, 
and the functor-isomorphisms: 

Ta,b,c : (A 0 5) ® ^ a 0 (5 0 C) 

f)A'. A® K ^ A, ..., 

where ® is symmetrical, if there exists a functor-isomorphism 

6a,b : A®B ^ B®A. 

A monoid in a monoidal category (C,®, K,ip,.. .) is an object M endowed 
a multiplication 

fj,: M®M ^ M 

and the unit morphism e : K ^ M + Axioms. 

A comonoid is a monoid in {C°^, ®, K,cp,. ..). In C we have the comulti¬ 
plication 

A: M ^ M®M 

the counit p -. M ^ K + Axioms. 

An action of a monoid M on A is dehned by 

a: M ® A^ A 


+ Axioms. 

A monoidal functor (a morphism of monoidal categories) of two monoidal 
categories is dehned by F : (C, K) (C', K') if 

F{A®B) = F{A) ®' F{B) 


and F{K) = K'. 



Example 3.3 A monoidal category is a monoid in the monoidal category 
{Cat, x) of categories with Cartesian product. 

Example 3.4 The category Symm with objects [n] for n = 0,1, ... and 
morphisms 


Symm{[n\, [m]) = 


0, if n ^ m, 
ifn = m. 

where is the group of permutations of {1,... ,n). with the multiplication 

* : Symm x Symm —> Symm 

such that [n] * [m] = [n + m — 1] with the folowing identification of the inputs 
(l,...,n) * (T, ...,m) = (l,...,n,2, ...,m) 


which explanes the action of * on morphisms. 

Example 3.5 Let {C, K) and {C, K') be two monoidal categories, F G 

Ob{CC') and F{K) = K'. Then for such functors F on the category there is 
a monoidal structure and a monoid is defined by a functor morphism 

Pa,b : F{A) F{B) ^ F{A ® B) 

with natural axioms associativity and unit. 


EXAMPLES 3.6-3.7 Bialgebras and Dual construction: 

Algebras as monoids in k-bf vect, k-sdg, Bialgebras as comonoids in k- 

alg, /c-bialg. 

Double Categories as monoids in the category of pairs of functors. 


4 Categories of information transformers 

4.1 Common structure of classes of information 
transformers 

It is natural to assume that for any information transformer a there are de- 
hned a couple of spaces: A and B, the space of “inputs” (or input signals) 
and the space of “outputs” (results of measurement, transformation, pro¬ 
cessing, etc.). We will say that a “acts” from A to B and denote this as 



a\ A ^ B. It is important to note that typically an information transformer 
not only transforms signals, but also introduces some “noise”. In this case it 
is nondeterministic and cannot be represented just by a mapping from A to 

B. 

It is natural to study information transformers of similar type by aggre¬ 
gating them into families endowed by a fairly rich algebraic structure [5,11]. 
Specihcally, it is natural to assume that families of ITs poses the following 
properties: 

{a) \i a\ A ^ B and b: B ^ C are two ITs, then their composition 
boa: A^ C is dehned. 

(b) This operation of composition is associative. 

(c) There are certain neutral elements in these families, i.e., ITs that 
do not introduce any alterations. Namely, for any space B there exist a 
corresponding YI i^\ B B such that i^o a = a and b o i^ = b. 

Algebraic structures of this type are called categories [ 6 , 8 ]. 

Furthermore, we will assume, that to every pair of information trans¬ 
formers, acting from the same space D to spaces A and B respectively, there 
corresponds a certain IT a * 6 (called product of a and b) from D to A x B. 
This IT in a certain sense “represents” both ITs a and b simultaneously. 
Specifically, ITs a and b can be “extracted” from a * b hj means of projec¬ 
tions TT^g and g from Ax B to A and B, respectively, i.e., tt^^o (a* 6 ) = a, 
z/^go(a*&) = b. Note, that typically, an IT c such that ^ 40 °^ ~ ~ ^ 

is not unique, i.e., a category of ITs does not have products (in category- 
theoretic sense [6-9]). Thus, the notion of a category of ITs demands for an 
accurate formalization. 

Analysis of classes of information transformers studied in [5, 10-18], gives 
grounds to consider these classes as categories that satisfy certain fairly gen¬ 
eral conditions. 

4.2 Elementary axioms for categories of information 
transformers 

In this subsection we set forward the main properties of categories of ITs. 
All the following study will rely exactly on these properties. 

In [5, 10-18] it is shown (see also examples in section 8 below) that classes 
of information transformers can be considered as morphisms in certain cate¬ 
gories. As a rule, such categories do not have products, which is a peculiar 



expression of nondeterministic nature of ITs in these categories. However, 
it turns out that deterministic information transformers, which are usually 
determined in a natural way in any category of ITs, form a subcategory with 
products. This point makes it possible to dehne a “product” of objects in a 
category of ITs. Moreover, it provides an axiomatic way to describe an ex¬ 
tension of the product operation from the subcategory of deterministic ITs 
to the whole category of ITs. 

Definition 4.1 We shall say that a category C is a category of information 
transformers if the following axioms hold: 

1. There is a fixed subcategory of deterministic ITs D that contains all 
the objects of the category C ('Ob(D) = Ob(C )). 

2. The classes of isomorphisms in D and in C coincide, that is, all the 
isomorphisms in C are deterministic. 

3. The categories D and C have a common terminal object Z. 

4- The category D has pairwise products. 

5. There is a specified extension of morphism product from the subcategory 
D to the whole category C, that is, for any object V and for any pair of 
morphisms a: V ^ A and b: V B in C there is certain information 
transformer a*b: V ^ Ax B (which is also called a product of ITs a 
and b) such that 

(^a * b) = a, (^a * b) = b. 

6. Let a: A ^ C and h: B ^ V are arbitrary ITs in C, then the IT a xb 
defined by Eg. (3) satisfy Eg. (5).' 

(a X 6) o (c * d) = {ao c) * {ho d). 

7. Eguality (7) holds not only in D but in C as well, that is, product of 
information transformers is “commutative up to isomorphism. ” 

8. Eguality (9) also holds in C. In other words, product of information 
transformers is “associative up to isomorphism” too. 



Now let us make several comments concerning the above definition. 

We stress that in the description of the extension of morphism product 
from the category D to C (cf. 5.) we do not require the uniqueness of an IT 
c:V-^A X B that satisfy conditions (1). 

Nevertheless, it is easily verified, that the equations (4) are valid for 
c = a X b not only in the category D, but in C as well, that is, 

^c,v o {a xb) = ao o {a xb) = bo 

However, the IT c that satisfy the equations (4) may be not unique. Note 
also that in the category C Eq. (2) in general does not hold. 

Further, note that the axiom 6 immediately implies 

(a X 6) o (c X d) = (a o c) X (6 o d). 

Finally note that any category that has a terminal object and pairwise 
products can be considered as a category of ITs in which all information 
transformers are deterministic. 


5 Category of information transformers as 
a monoidal category 

As we have already mentioned above in a category of ITs there are certain 
“meaningfnl” operations of prodnct for objects and for morphisms. However, 
these operations are not prodnct operations in category-theoretic sense. Nev¬ 
ertheless, every category of ITs is a monoidal eategory (see, e.g., [6, 8]). 

First note, that every category D with pairwise prodncts and with ter¬ 
minal object Z constitntes a monoidal category {D, x, Z,a, X, p), where 
x: DxD ^ D is the product functor and {Ax B) x C ^ Ax {B x C), 

\^: Z X A ^ A, and p^: A x Z ^ A are the obvious natural equivalences. 
Besides, as a category with prodncts, the category D has a natnral eqniva- 
lence cr, Ax B B x A, which interchanges components in a prodnct. 

Definition 5.1 We will say that a eategory C is a category of information 
transformers over a suheategory (of deterministic ITs) D if the following 
three axioms hold. 



Axiom 1. (C, x,Z,a, A, p) is a monoidal category for a certain: functor 
X : C X C —C, object Z, and natural equivalences a, X and p. 

We will refer to morphisms of the category C as information transformers. 

Axiom 2. The category C has a subcategory D, such that all the objects 
of C are contained in D, 2 is a terminal object in D, and the functor x is 
a product functor on D. 

Morphisms of the subcategory D will be called deterministic information 
transformers. 

Thus, the following properties hold in the subcategory D; 

(a) There are natural transformations dehned in D, A x B A 
and Ax B B that specify projections on components of a product. 

(b) For any deterministic Its (morphisms in D) a: C ^ A and b: C ^ B 
there exists e unique IT c = a * b: C ^ A x B for which tt. o c = a and 

(c) D is also a monoidal category with the natural equivalences a, X and 
p explicitly expressed through tt and z/, i.e., 

def def 

^ Pa ^ ^A,Z^ 

~ ^'^A,B ° '^AxB,c) * {^^A^B ° '^AxB,c) * ^AxB,(^ ' 

(d) There is a natural equivalence of “object transposition” a dehned on 

D; 

^A,B ^ ’^A,B * '^A,B' X\.X B ^ B X A. 

(e) There is a “diagonal” natural transformation <5 dehned on D; 

S^=U^*Y. C^CxC. 

Note that with the help of the “diagonal” natural transformation the 
product of morphisms a*b may be expressed through their “functorial prod¬ 
uct” a X b, i.e., a * b = {a x b) o S^. 

Let us stress here, that we do not require that 5 is a natural transforma¬ 
tion on the whole category C. Furthermore, typically, in many important 
examples of categories of ITs 6 is not a natural transformation. Such cat¬ 
egories do not have products in category-theoretic sense. However we can 
extend the product operation for morphisms from the subcategory D to C. 
Specihcally, we dehne in C; 

a * 6 {a X b) o 5^. 



Axiom 3. Natural transformations tt, v and a (in the category D) are 
natural transformations in the whole category C as well. 

Theorem 1. Definitions 4-i and 5.1 are equivalent. 


6 IT-Category as Kleisli category 

6.1 Concept of distribution. Kleisli category 

The two equivalent dehnitions presented above provide the minimal concep¬ 
tual background for studying categories of ITs, e.g., for dehnition and analysis 
of informativeness, semantic informativeness, decision problems, etc. [1-5, 
10-18]. However these dehnitions do not provide any tools for constructing 
categories of ITs on the basis of more elementary concepts. The concept of 
distribution is one of the most important and it plays a critical role in the 
uniform construction of a wide spectrum of IT-categories. Its importance is 
connected to the observation that in many important IT-categories an infor¬ 
mation transformer a: B may be represented by a morphisms from A to 

the “object of distributions” over B. For example, a probabilistic transition 
distribution (an IT in the category of stochastic ITs) may be represented by 
a certain measurable mapping A to the space of distributions on B. 

Thus, we will suppose that on some hxed “base” category D (category 
of deterministic ITs) there dehned a functor T, which takes an object A 
to the object TA of “distributions” on A. Besides, we assume that there 
are two natural transformations connected to this functor: rj: I ^ T and 
fi: TT T. Informally, rj^: A TA takes an element of A to a “discrete 
distribution, concentrated on this element”, and TTA ^ TA “mixes” 
(averages) a distribution of distributions on A, by transforming it to a certain 
distribution on A. Besides, there are natural “coherence” conditions for rj 
and /i: 

h_4 ° ° Dr A 

and 

t^A ° ^ ° Dr A ^ ^A 



that may be presented by the following commutative diagrams: 

TTTA -- TTA TA -- TTA -- TA 



TTA -^- -TA TA 


Commutativity of the square means that for any “third-order distribu¬ 
tion” on A (i.e. distribution on a collection of distributions on a family of 
distributions on A) the result of “mixing” of distributions does not depend 
on the order of “mixing”. More precisely, the result of mixing over the “top” 
(third order, element of TTTA) distribution hrst and mixing the resulting 
second-order distribution next should give the same result as for mixing over 
“intermediate” (second-order, elements of TTA) distributions hrst and then 
mixing the resulting second order distribution. Commutativity of the left 
triangle means that mixing of a second order distribution, concentrated in 
one element (which is itself a distribution on A) gives this distribution. Fi¬ 
nally, commutativity of the right triangle means if we take some distribution 
on A, transform it to “the same” distribution of singletons and then mix the 
resulting second-order distribution, we will obtain the original distribution. 

It is well known, that a collection (T, t], fi), satisfying the two commutative 
diagrams above, is called a triple (monad) [6, 8, 23] on the category D. 

The concept of triple provides an elegant technique of constructing a 
category of ITs C on the basis of the category of deterministic ITs, as a 
Kleisli category [6, 23]. In this construction each morphisms a\ A B in 
the category C is determined by a morphism a' \ A ^ TB of the category D. 
The composition aofeoflTsa: A ^ B and 6: C in C is represented by 

the morphism 

ip o a)' o Th' o a' 

f 'T'W 

{bo ay = A - ^TB - -TTC—^TC 

in D, and any deterministic IT c: C —^ "D (in C) are determined by the 
morphism 

/ def 

C = Tj^oc 

, ^ c rt 

c' = C - -V - '-^TV 


in D. 



6.2 Independent distribution. 

Monoidal Kleisli category 

The main factor in the construction of the category of ITs as a Kleisli category 
is equipping it with a structure of monoidal category. For this purpose we in¬ 
troduce a natural transformation 7 : xT ^ Tx, TAxTB ^ T{AxB), 
which “takes” a pair of distributions to their “independent joint distribution” 
(see also [35]). Then the product c = a * b of ITs a: V A and b: V ^ B 
(in C) is determined by the morphism 

/ def / / 7 /\ 

^ = ^A,B° A 



in D. Note, that a' * b' here exists and is uniquely dehned since D is a 
category with products. 

Theorem 2. Suppose that D is a category with pairwise products and 
with terminal object Z; tt, v, a, a are the corresponding natural transforma¬ 
tions, and {T, rj, p) is a triple on D with monomorphic for every B. Then 
the generated Kleisli category C, eguipped with a natural transformation 7 , 
is a category of information transformers if and only if the following com¬ 
patibility conditions of p with the natural transformations n, v, a, a, p, and 
p hold: 

77-7 and 1/-7 conditions: 


° '^Ap '^TA,TB ^^AP ° ^ 4,6 ^TA,TB 

cr -7 condition: 

'^^AP ° '^AP ^ %,A ° ^TA,TB 

cx-'y condition: 

^^AP,C ° '^AxBP ° ^'^Ap ^ ^ ^APxC ° ^^TA ^ ° ^A,TB,Te 



^i -7 condition: 


^^AxB ° ° '^TA,TB '^A,B ° ^^^A ^ 

77-7 condition: 

^A,B°(^A^%)=^AxB 

Thus, construction of a categories of ITs is, in effect, reduced to selection 
of a base category D, a functor T; D —^ D, and a natural transformation 
7 : X T —Tx. 

All these conditions have rather transparent meaning that we will try to 
comment below. 

For better understanding we also provide the corresponding commutative 
diagrams in which we omit the obvious indices for the sake of readability: 

77-7 and i/-7 conditions. Marginal distributions extracted from indepen¬ 
dent joint distribution coincide with the original distributions: 

TA 

ttT Ttt 



cr-7 condition. Transposition of components of an independent joint dis¬ 
tribution leads to the corresponding transformation of the joint distribution, 
i.e.. Independent joint distribution is “invariant” with respect to transposi¬ 
tion of its components. More precisely, we can say that the independent dis¬ 
tribution morphism for transposed components TB x TA T{B x M) 
is naturally isomorphic to the original morphism TA x TB ^ T{A x 
B). The corresponding isomorphism (of morphisms) is provided by the pair 

(^'^TA,TB ^ 

TA X TB -^-- T{A X B) 

(jT Ta 


TB X TA 


7 


T{B X A) 



Q :-7 condition: Independent joint distribution for three components is 
“naturally invariant” with respect to the order of parentheses. More precisely, 
the morphisms 


T4 ,hxc ° ^ %-r) ■ TA X {TB X TC) ^ T{A X {B X C)) 


IB,C 


and 


^AxB,c ° ^ hr) ■ {T Ax TB) xTC ^T{{Ax B) xC) 


U,B TC> 

(that take independent joint distributions for three components with different 
order of parentheses) are naturally isomorphic via (^^hATBTC ^ '^'^A t 


.,B,C 


{TA X TB) X TC ^T{A x B) x TC - - --T((M x B) x C) 

aT Ta 


TA X {TB X TC) ►TM x T{B x C) ■ 


7 


-T{Ax {B X C)) 


/x -7 condition. Independent joint distribution for results of mixing of two 
second-order distributions may also be obtained by mixing the corresponding 
second-order independent distributions: 

TTA X TTB ► T{TA x TB) ► TT{A x B) 

fjL X jj, fi 

- -T{AxB) 


TA X TB 


7 


r /-7 condition: Independent joint distribution for two “singleton” dis¬ 
tributions is just the corresponding “singleton” distribution on a product 
space: 

AxB 


T{A X B) 


TA X TB 




7 


Informativeness of 
information transformers 


7.1 Accuracy relation 

In order to define informativeness relation we will need to introduce first the 
following auxiliary notion. 

Definition 7.1 We will say that > is an accuracy relation on an IT-eategory 
C if for any pair of objects A and B in C the set C{A, B) of all ITs from A to 
B is equipped with a partial order > that satisfies the following monotonicity 
conditions: 

a > a', b \> b' a o b > a' o b', 

a > a', b \> b' a *b > a' *b'. 

Thus, the composition and the product are monotonous with respect to 
the partial order >. For a pair of ITs a, 6 G C(A, B) we shall say that a is 
more accurate then b whenever a \> b. 

It obviously follows from the very dehnition of the operation x (3) and 
from the monotonicity conditions that the operation x is monotone as well: 

a [> a', b [> b' a x b > a' x b'. 

It is clear that for any IT-category there exists at least a “trivial variant” 
of the partial order >, namely, one can choose an equality relation for >, 
that is, one can put a \> b a = b. However, many categories of ITs 

(for example, multivalued and fuzzy ITs) provide a “natural” choice of the 
accuracy relation, which is different from the equality relation. 

7.2 Definition of informativeness relation 

Suppose a: V ^ A and b: V ^ B are two information transformers with 
a common source V. Assume that there exists an IT c: A —H such that 
coa = b. Then any information that can be obtained from b can be obtained 
from a as well (by attaching the IT c next to a). Thus, it is natural to 
consider the information transformer a as being more informative than the 
IT b and also more informative than any IT less accurate than b. 

Now we give the formal dehnition of the informativeness relation in the 
category of information transformers. 



Definition 7.2 We shall say that an information transformer a is more 
informative (better) than b if there exists an information transformer c such 
that CO a > b, that is, 


a b 


def 


3c CO a > b. 


It is easily verified that the informativeness relation is a preorder on 
the class of information transformers in C. This preorder ip induces an 
equivalence relation ~ in the following way: 


def 


a ip b & b ip a. 


Obviously, the relation “more informative” extends the relation “more 
accurate,” that is, 

a > h a ip b. 


7.3 Main properties of informativeness 

It can be easily verihed that the informativeness relation ip satishes the 
following natural properties. 

Lemma 1. Consider all information transformers with a fixed source V. 

(a) The identity information transformer i^ is the most informative and 
the terminal information transformer is the least informative: 

Va i^ ip a ip z^. 

(b) Any information transformer a: V ^ B x C is more informative than 
its parts ^ 

(c) The product a*b is more informative than its components 

a * b ip a,b. 

Furthermore, the informativeness relation is compatible with the compo¬ 
sition and the product operations. 

Lemma 2. 

(a) If a ip b, then a o c ip b o c. 

(b) If a ip b and c ip e, then a * c ip b * e. 



7.4 Structure of the family of 

informativeness equivalence classes 

Let a be some information transformer. We shall denote by [a] the equiva¬ 
lence (with respect to informativeness) class of a. We shall also use boldface 
for equivalence classes, that is, a G a is equivalent to a = [a]. 

Theorem 3. Let Z(L)) be the family of informativeness equivalence 
classes for the class of all information transformers with a fixed domain V. 
The family Z(T)) forms a partial ordered Abelian monoid {Z(T)), )p,*,0) with 
the smallest element 0 and the largest element 1 , where 

[a] ^ [6] a ^ 6, [a] * [b] [a*b], 0 [zj, 1 [g. 

Moreover, the following properties hold: 

(a) 0 * a = a, 

(b) l*a=l, 

(c) 0 ^ a ^ 1, 

(d) a*b^a,b, 

(e) (a:^b) & (O^e) ^ a*c;^b*e. 

8 Informativeness and synthesis of 
optimal information transformers 

In this section, we consider an alternative (with respect to the above) ap¬ 
proach to informativeness comparison. This approach is based on treating 
information transformers as data sources for decision-making problems. 

8.1 Decision-making problems in categories of ITs 

Results of observations, obtained on real sources of information (e.g. indi¬ 
rect measurements) are as a rule unsuitable for straightforward interpreta¬ 
tion. Typically it is assumed that observations suitable for interpretation are 
those into a certain object U which in what follows will be called object of 
interpretations or object of decisions. 



By an interpretable information transformer for signals from an object V 
we mean any information transformer a: V U. 

It is usually thought that some interpretable information transformers are 
more suitable for interpretation (of obtained results) than others. Namely, 
on a set C{V,ll) of information 

transformers from V to U, one dehnes some preorder relation 3>, which 
specihes the relative quality of various interpretable information transform¬ 
ers. Typically the relation 3> is predetermined by the specihc formulation 
of a problem of optimal information transformer synthesis (that is, decision¬ 
making problem). 

We shall say that an abstract decision-making problem is determined by 
a triple (P,W, 3>), where V is an object of studied (input) signals, U is 
an object of decisions (or interpretations), and 3> is a preorder on the set 
C{V,U). 

We shall call a preorder monotone if for any a,b E C{V,U) 

a > h ab, 

that is, more accurate IT provides better quality of interpretation. 

For a given information transformer a: V ^ A we shall also say that an 
IT b reduces a to an interpretable information transformer ii b o a: V U, 
that is, if b: A ^ lA. Such an information transformer b will be called a 
decision strategy. 

The set of all interpretable information transformers obtainable on the 
basis of a: V ^ A will be denoted Uq C C{V,U): 

U/= {boa\b: A^U}. 

We shall call a decision strategy r\ A ^ U optimal (for the IT a with 
respect to the problem {T>,U, 3>)) if the IT r o a is a maximal element in Ua 
with respect to 3>. Thus, a decision-making problem for a given informa¬ 
tion transformer a is stated as the problem of constructing optimal decision 
strategies. 

8.2 Semantical informativeness 

The relation ^ induces a preorder relation □ on a class of information trans¬ 
formers operating from V in the following way. 



Assume that a and b are information transformers with the source V, 
that is, a: V ^ A, b: V B. By dehnition, put 

44 W:B^U 3a': A a'oa^b'ob. 

In other words, a □ 6 if for every interpretable information transformer d 
derived from b there exists an interpretable information transformer c derived 
from a such that d, that is, 

a □ 6 Vd G Uf, 3c G Ua d. 

It can easily be checked that the relation □ is a preorder relation. 

It is natural to expect that if one information transformer is more in¬ 
formative than the other, then the former will be better than the latter in 
any context. In other words, for any preorder ^ on the set of interpretable 
information transformers the induced preorder □ is dominated by the infor¬ 
mativeness relation (that is, □ is weaker than )p). The converse is also 
trne. 

Definition 8.1 We shall say that an information transformer a is seman¬ 
tically more informative than b if for any interpretation object U and for any 
preorder ^ (on the set of interpretable information transformers) a □ 6 for 
the induced preorder □. 

The following theorem is in some sense a “completeness” theorem, which 
establishes a relation between “structure” {b can be “derived” from a) and 
“semantics” (a is nniformly better then b in decision-making problems). 

Theorem 4. 

For any information transformers a and b with a common source V, in¬ 
formation transformer a is more informative than b if and only if a is se¬ 
mantically more informative than b. 

Let us remark that the above proof relies heavily on the extreme extent 
of the class of decision problems involved. This makes it possible to select for 
any given pair of ITs a, b an appropriate decision-making problem {V, Ub, 3>fe) 
in which the interpretation object Ub and the preorder depend on the IT 
b. However, in some cases it is possible to point out a concrete (nniversal) 
decision-making problem snch that 


a)^ b 


a □ &. 



Theorem 5. 

Assume that for a given object V there exists an object V such that for 
every information transformer acting from V there exists an eguivalent (with 
respect to informativeness) IT acting from V to V, that is, 

Vi3 yb-.V-^B 3b':V^V b^y. 

def 

Let us choose the decision object U = T> and the preorder defined by 

1 7 

d c > d. 

Then a )p b if and only if a^b. 

Note that in general case an optimal decision strategy (if exists) can be 
nondeterministic. However, in many cases it is sufficient to search optimal 
strategies among deterministic ITs. Indeed, in some categories of informa¬ 
tion transformers the relation of “accuracy” satishes the following condition; 
every IT is dominated by some deterministic IT, that is, for every IT there 
exists a more accurate deterministic IT. 

Proposition 1. 

Assume that {V,U,^) is a monotone decision-making problem in a cat¬ 
egory of ITs C. Assume also that the following condition holds: 

Vc e Ar(C) 3d G Ar(D) d > c. 

Then for any IT a: V ^ IZ and for any decision strategy r-.lZ—^U there 
exists a deterministic strategy r^\ IZ -^U such that o a 3> r o o. 


9 Decision-making problems with 
a prior information 

In this section we formulate in terms of categories of information transformers 
an analogy for the classical problem of optimal decision strategy construc¬ 
tion for decision problems with a prior information (or information a priori). 
We also prove a counterpart of the Bayesian principle from the theory of 
statistical games [24, 36]. Like its statistical prototype it reduces the prob¬ 
lem of constructing an optimal decision strategy to a much simpler problem 
of hnding an optimal decision for a posterior information (or information a 
posteriori). 



First we define in terms of categories of information transformers some 
necessary concepts, namely, concepts of distribution, conditional information 
transformer, decision problem with a prior information, and others. 


9.1 Distributions in categories of ITs 

We shall say that a distribution on an object A (in some fixed category of 
ITs C) is any IT f: Z ^ A, where Z is the terminal object in C. 

The concept of distribution corresponds to the general concept of an 
element of some object in a category, namely, a morphism from the terminal 
object (see, e.g., [9]). 

Any distribution of the form h: Z Ax B will be called a joint distribu¬ 
tion on A and B. The projections ^ and ^ on the components A and B 
respectively, “extract” marginal distributions f and g of the joint distribution 
h, that is, 

f = a 

9 = Z B. 

We say that the components of a joint distribution h: Z ^ A x B are 
independent whenever this joint distribution is completely determined by its 
marginal distributions, that is. 


Let / be an arbitrary distribution on A and let a: .4. ^ be some 
information transformer. Then the distribution g = a o f in some sense 
“contains an information about /.” This concept can be expressed precisely 
of one consider the joint distribution generated by the distribution / and the 
IT a: 


h: Z A X B, h = {i^* a) o f. 


Note, that the marginal distributions for h coincide with / and g, respectively. 
Indeed, 


^A,B°^ = ^ 6 ° (*. 4 * «) ° / = *^° / = /> 


^A,B ° h = {i_^* a) O f = a O f = g. 

Let h be a joint distribution on Ax B. We shall say that a: A ^ B 
is a conditional IT for h with respect to A whenever h is generated by the 
marginal distribution h and the IT a, that is. 


h = {i^*a)o o h. 



Similarly, an IT 6: i3 —^ such that 

h={h*i^o o h 

will be called a conditional IT for h with respect to B. 

9.2 Bayesian decision-making problems 

Suppose that, like in Section 4, there are hxed two objects V and U in some 
category of ITs, namely, the object of signals and the object of decisions, 
respectively. In a decision-making problem with a prior distribution f on V 
one hxes some preorder on the set of joint distributions on V x U for 
which P-marginal distribution coincides with /. 

Informally, any joint distribution h onVxU of this kind can be considered 
as a joint distribution of a studied signal (with the distribution f = h 

on V) and a decision (with the distribution g = h on U). The preorder 
determines how good is the “correlation” between studied signals and 
decisions. 

Formally, an abstract decision problem with a prior information is deter¬ 
mined by a quadruple (T>, W, /, 3>/), where V is an object of studied signals, 
U is an object of decisions (or interpretations), f'■ Z ^ "D is a prior dis¬ 
tribution (or distribution a priori), and 3>/ is a preorder on the set of ITs 
h: Z X U that satisfy the condition h = f. 

Furthermore, suppose that there is a hxed IT a: V TZ (which deter¬ 
mines a measurement; IZ can be called an object of observations). An IT 
r:7Z—^Uis called optimal (for the IT a with respect to 3>/) if the distribu¬ 
tion [i * r o a) o f is a maximal element with respect to 3>/. The set of all 
optimal information transformers is denoted Optj(a o /). 

Theorem 6 (Bayesian principle). 

Let f be a given prior distribution on V, let a: V 7Z be a fixed IT, 
and let b: IZ ^ V be a conditional information transformer for {i * a) o f 
with respect to IZ. Then the set of optimal ITs r: IZ -^U, namely, the set of 
optimal decision strategies for f over ao f coincides with the set of optimal 
decision strategies for bog over g, where g = ao f: 

Optfiao f) = Opt,,„g{g). 

In a wide class of decision problems (e.g., in linear estimation problems) 
an optimal IT r happens to be deterministic and is specihed by the “deter¬ 
ministic part” of the IT b. 



For many categories of information transformers (for example, stochas¬ 
tic, multivalued, and fuzzy ITs [13, 15, 24]) an optimal decision strategy 
r can be constructed “pointwise” according to the following scheme. For 
the given “result of observation” y E TZ consider the conditional (posterior) 
distribution b{y) for / under a hxed g = y, and put 

r{y) = db(y), 

where db(y) is an optimal decision with respect to the posterior distribution 

%)• 

10 Examples of categories of 

information transformers 

In this section we present several examples of different classes of information 
transformers. The major difference between them is the way of representing 
uncertainty. In each case (except the category stochastic linear ITs, which 
cannot be constructed as a Kleisli category, but is a sub category of one) 
we will mention the corresponding: base category D, functor T, and natural 
transformation 7. “Elementary” dehnitions for these categories may be found 
in [14, 21, 22]. 

10.1 Stochastic ITs 

Let D = Meas, the category of measurable spaces and measurable maps, 
is the space of all probability measures on A, (details may be found in 
[37]) and 7 ^^ takes a pair of distributions V, Q to their product V ® Q, a. 
distribution on Ax B. 

The category of stochastic information transformers ST consists of mea¬ 
surable spaces (as objects) and transition probability functions (as mor- 
phisms, that is, information transformers) [3, 4, 37]. Note that a classical sta¬ 
tistical experiment (namely, a parametrized family of probability measures), 
a statistics (namely, a measurable function of a sample of observations), and 
a decision strategy (possibly, nondeterministic) can be represented by appro¬ 
priate transition probability functions. Thus, all the above concepts ht in 
well with this scheme. 



Suppose A = ©^) and jB = ©g) are two measurable spaces. 

A stochastic information transformer a: A ^ B is determined by a real¬ 
valued function (transition probability function [3, 38, 39] Pa{uj,B) of two 
arguments u G B E &b that satisfy the following conditions: 

(a) Given a hxed event B G ©g, the map Pa{-, B) is a measurable function 
on 

(b) Given a hxed elementary event ui G the map Pa{oj, ■) is a probabil¬ 
ity measure on (Gg, ©g). 

For a given stochastic information transformers a\ A ^ B and b: B ^ C 
their composition b o a in the category ST corresponds to the transition 
probability function (see [3,37] 

Pboaioo, C) = fptiu;', C) P^ioo, du') Vo; G VG G ©c- 
Qb 

The subcategory of deterministic ITs is actually a category Meas of mea¬ 
surable spaces and measurable maps. To every measurable map (p\ A B 
there corresponds the transition probability function 




def 



if (p{uj) G B, 
if ^{uj) ^ B] 


Vcn G 'iB G &B 


The category Meas has products, namely, the product of measurable 
spaces A and B in Meas is (a x i3, g, where 


AxB = X Gb, ©^ © &b) , 


TT^ g and Q are the projections from the Gartesian product r2_4 x Gg onto 
its components and Gg respectively, and ©^ © ©e is the product of a- 
algebras ©_4 and ©g. 

For a given pair of ITs a: V ^ A and b: V ^ B with a common source we 
dehne their product a*b: V ^ Ax B so that for every u G flx> the probability 
distribution Pa*b{^^, ■) on Vl x is the product © of the distributions Pa{uj,-) 
and Pb{u,-), that is. 


Pa*b{^, •) — Pa{^, •) © Pb{^^, •) Vcn G flj). 



In other words (see, for example, [38]), [?]), the distribution is completely 
determined by the following condition: 


Pa*b{,^) X -B) — Pa{uj, A) -B) VcU G ffx), VA G ©yl, VP G &B- 

The only obvious choice for the accuracy relation in the category of 
stochastic ITs seems to be the equality relation. 

Now let us demonstrate that the basic concepts of mathematical statistics 
are adequately described in terms of this IT-category. Namely, we shall verify 
that the concepts of distribution, conditional distribution, etc. (introduced 
above in terms of IT-categories), in the category of stochastic ITs lead to the 
corresponding classical concepts. 

Indeed, any probability distribution Q on a given measurable space A = 
©yi) is uniquely determined by the morphism f: Z ^ A from the 
terminal object Z = ({0}, { 0 , {0}}) (a one-point measurable space) such 
that 

Pf{0,A) = Q{A) VAg©^. 

In what follows we shall omit the hrst argument in Pf{0,A) and write just 
Pf{A) instead. 

A statistical experiment is described by a family of probability measures 
Qg on some measurable space B. This family is usually parametrized by 
elements of a certain set Sometimes (especially when statistical problems 
with a prior information are studied) it is additionally assumed that the 
set is equipped by some a-algebra ©^ and that Qg{B) is a measurable 
function of 0 G for all B G &b (and thus, Qg{B) is a transition probability 
function [39]). Therefore, such statistical experiment is determined by the 
stochastic information transformer a: A ^ B, where 

Pa{e, B) = Qg{B) ye g vp g ©h. 

In the case when no a-algebra on the set is specihed, one can put 
©^ = that is, the a-algebra of all the subsets of the set It is 

clear that in this case the function Pa{e, B) = Qg{B) is a measurable function 
of 6^ G for every fixed B & &b and thus (being a transition probability 
function), is described by a stochastic YI a\ A ^ B. 

Note also, that any statistic, being a measurable function, is represented 
by a certain deterministic IT. Decision strategies also correspond to deter¬ 
ministic ITs. At the same time, nondeterministic (mixed) decision strategies 



are adequately represented by stochastic information transformers of general 
kind. 

Now, let / be some fixed distribution on A and let a: ^ ^ be some 
IT. The joint distribution h on A x B, generated by / and a (from the 
IT-categorical point of view, see Section 7) is 

h = {i * a) o f. 

It means that for every set A x B, where A G and B G fig, 

Ph{Ax B) = jPi^a{oJ, Ax B) Pf{duj) 

= fPi{uj,A)Pa{uj,B)Pf{duj) 

= jPa{uJ,B)Pf{du). 

A 

Thus we come to the well known classical expression for the generated joint 
distribution (see, for example, [39]). 

Now assume that P/ is considered as some probability prior distribution 
(or distribution a priori) on A. Then for a given transition probability func¬ 
tion Pa, a posterior (or conditional) distribution Pb{oj', ■) on A for a hxed 
u' G flj 3 is determined, accordingly to [39] by a transition probability func¬ 
tion Phioj', A), oj' & VLb, a G such that 

Ph{AxB) = jPb{u',A)Pg{du') VAg©^, VPG©g, 

B 


where 

Pg{B)= fPa{u,B)Pf{du) VPG©g. 

It is easily verihed that in terms of ITs the above expressions have the fol¬ 
lowing forms: 

h = {b * i) o g, 


where 


9 = a° f- 



This shows, that the classical concept of conditional distribution is ade¬ 
quately described by the concept of conditional IT in terms of categories 
of information transformers. 

10.2 Linear stochastic ITs with additive noise 

As we will see this category of ITs cannot be constructed as a Kleisli category, 
but is a subcategory the category of stochastic ITs, examined above. 

Suppose V and TZ are arbitrary finite-dimensional Euclidean spaces. We 
shall say that a linear information transformer [11, 12] (measurement model 
[40]) [?]) a acting from V to TZ 


a: V TZ, 

is determined by a pair 

^a) ) Zta ■ T) ^ TZ, : TZ ^ TZ, ^ 0, 

where Aa and are linear maps. 

Such pair {Aa, represents a statistical experiment of the form [40] 
y = AaX + u, X eT), y eTZ, 

where z/ is a random vector in TZ with the zero mean and the correlation 
operator 

The composition ot two linear ITs {Aa, : T) A and {Ab, T^b) : A B 
is dehned by 


{Ab, Sfe) O {Aa, S,) = {AbAa, Sfe + AbTlaAl ). 

The composition corresponds to the consecutive connection of information 
transformers that have independent random errors. 

The product of two information transformers 

-.V^A, {Ab,Eb) :V^B 


is dehned by: 



where 


-^a^h ' ^ ^ ^ ^ 1 i 

Sa*6 : ^ X i3 ^ ^ X i3, Sa*6 (x, y) =^ (SaX, Sfe|/) . 

This construction gives us the category SLT with the subcategory of 
deterministic ITs is (isomorphic to) the category of Euclidean spaces and 
linear maps. In this case a linear map A: V TZ corresponds to the IT 

(AO) :V-^n. 

As we have already mentioned this category of ITs cannot be constructed 
as a Kleisli category over the category of hnite dimensional Euclidean spaces. 
Indeed we can not dehne a “space of distributions” on some space A as a hnite 
dimensional linear space, and thus, can not dehne functor T in the category 
of hnite dimensional Euclidean spaces. However, SLT may be considered as 
a subcategory the category of stochastic ITs ST, examined above. Indeed, 
each Euclidean space may be considered as a measurable space endowed 
with Borel a-algebra. Finally, we may consider an IT a = (A^, : V TZ 

(in SLT) as the transition probability, that takes an element x E V to the 
normal distribution N{AaX, S^) with the mean value AaX and the correlation 
operator Routine verihcation shows, that the composition and product 
operations are preserved under such inclusion of SLT into ST. 

In addition to the trivial relation of accuracy (which coincides with the 
equality relation) one can dehne the accuracy relation in the following way: 

(Aa, Sa) > (Ab, Sb) Aa = Ab, Sa ^ ^b. 

However, it can be proved that the informativeness relations corresponding 
these diherent accuracy preorders, actually coincide. 

In the category of linear information transformers every equivalence class 
[a] corresponds to a pair {Q, S), where Q C "D is an Euclidean subspace and 
S: Q —Q is nonnegative dehnite operator, that is. S' ^ 0. In these terms 

(QlAl) ^ (Q 2 A 2 ) 44 QiDQ 2 , Si\Q 2 ^S 2 . 

Here Si \ Q 2 (the restriction of Si on Q 2 ) is dehned by the expression 

Si \ Q2 *= P2I1S1P1I21 where IjQj ^ V is the subspace inclusion, and 
Pj-.V-^ Qj is the orthogonal projection (cf. [11, 40]). 

Note also that in the category of linear information transformers every IT 
is dominated (in the sense of the preorder relation >) by a deterministic IT. 



Hence, according to Proposition 2, in any monotone decision-making problem 
without loss of quality one can search optimal decision 
strategies in the class of deterministic ITs. 

It is shown in [12], that in the category of linear ITs for any joint distri¬ 
bution there always exist conditional distributions. Thus in problems with 
a prior information one can apply Bayesian principle. Its direct proof in 
the category of linear ITs as well as the explicit expression for conditional 
information transformers can be found in [12]. 

10.3 The category of sets as a category of ITs 

As a trivial example of IT-category we consider the category of sets Set, 
whose objects are sets and morphisms are maps. This category has products, 
hence all the ITs are deterministic. In fact this category is trivially a Kleisli 
category with identity functor as functor T. 

It is not hard to prove that for a given set V, the class of equivalent 
informativeness for an IT a with the set V being its domain, is completely 
determined by the following equivalence relation on V: 

def w ^ 

X y cix = ay vx, y ^ U. 

Furthermore, a 6 if and only if the equivalence relation is finer than 
~fe, that is. 


a)p b Vx, y ^ V [x y x y) ■ 

Thus, the partially ordered monoid of equivalence classes for ITs with 
the source V, is isomorphic to the monoid of all equivalence relations on V 
equipped with the order “hner” and with the product: 

X (~a * y {x ~a y, X ^b y) Vx, y ev. 

10.4 Multivalued ITs 

Let D = Set, the category of sets, is the set of all nonempty subsets of A 
and 7 ^ ^ takes pair of sets "P, Q to their Cartesian product P x Q, a subset of 
A X B. This leads us to the category MVT of multivalued ITs. Detailed study 
of this category may be found in [14]. Thus, the category MVT consists of 
sets as objects and of multivalued maps (everywhere dehned relations) as 



morphisms (information transformers). Despite its simplicity, this class of 
ITs may be convenient when stochastic description of measurement error is 
inadequate. 

So, a multivalued IT a from V to TZ 

a: V ^ TZ 

is determined by a multivalued map, that is, 

\lx eV ax C7Z, ax ^ 0. 

Define the composition and the product of multivalued ITs by the following 
expressions: 

{b o a){x) [by \ y E ax], 

{a * b) {x) ax x bx. 

The subcategory of deterministic ITs is actually the category of sets Set. 

In addition to the trivial accuracy relation in the category of multivalued 
ITs one can put 

a > b tfx E V ax C bx. 

These two accuracy relations lead to different informativeness relations 
[14], called (strong) informativeness and weak informativeness 

For the both informativeness relations the classes of equivalent ITs with 
a fixed source V can be described explicitly. 

In the case of weak informativeness every class of equivalent ITs corre¬ 
sponds to a certain covering V of the set V, such that if V contains some set 
B then it contains all its subsets: 

(3S eV{ACB)) ^ AeV. 

Moreover, a covering Vi is more (weakly) informative than V 2 (namely, 
Vi corresponds to a class of more (weakly) informative ITs than V 2 ) if Vi is 
contained in V 2 , that is, 

Vi'^V2 44 Pi C P 2 . 

In the case of (strong) informativeness every class of equivalent ITs corre¬ 
sponds to a covering V of the set P, that satisfy the more complex condition: 

(^{^BeVACB) & {3B0V A = Ui3)) ^ AeV. 



In this case 

Vi)^V2 44 (VA e Pi 3P G P 2 A (ZB) 

& (VPgPs 3^CPi P = U^)). 

In the category of mnltivalued information transformers every IT is dom¬ 
inated (in the sense of the partial order >) by a deterministic IT. Thus, 
in the monotone decision-making problem one can search optimal decision 
strategies in the class of deterministic ones. 

For every joint distribution in the category of multivalued ITs there exist 
conditional distributions [13]. Therefore, in decision problems with a prior 
information, the Bayesian approach can be effectively applied. 

10.5 Categories of fuzzy information transformers 

Here we dehne two categories of fuzzy information transformers FMT and 
FPT that correspond to different fuzzy theories [15] Let D = Set, TA is the 
set of all normalized fuzzy subsets of A and 7 ^ ^ takes pair of fuzzy subsets 
P, Q to the fuzzy subset P x Q, (P x Q){x,y) = V{x) ® Q(|/),where the 
operation ® may be dehned in a variety of ways. The most common are the 
minimum (the category FMT) and product (the category FPT) operations 
|15], 

Objects of these categories are arbitrary sets and morphisms are every¬ 
where dehned fuzzy maps, namely, maps that take an element to a normed 
fuzzy set (a fuzzy set A is normed if supremum of its membership function 

is 1). Thus, an information transformer a\ A ^ B is dehned by a mem¬ 
bership function which is interpreted as the grade of membership of 

an element y E B to a fuzzy set ax for every element x E A. 

The category FMT. Suppose a: A ^ B and b: B ^ C are some fuzzy 
maps. We dehne their composition & o a as follows: for every element x E A 
put 



For a pairs of fuzzy information transformers a\ T> A and b\ T> ^ B 
with the common source P, we dehne their product as the IT that acts from 
V to the Cartesian product A x B, such that 



The category FPT. Define the composition and the product by the fol¬ 
lowing expressions: 


/i 




In the both defined above categories of fuzzy information transformers 
the subcategory of deterministic ITs is (isomorphic to) the category of sets 
Set. Let g: A-^ B he some map (morphism in Set). Define the correspond¬ 
ing fuzzy IT (namely, a fuzzy map, which is obviously, everywhere defined) 
g: A B in the following way: 




1, if g{x) = y, 
0, if g{x) ^ y. 


Concerning the choice of accuracy relation, note, that in these IT-cate- 
gories, like in the category of multivalued ITs, apart from the trivial accuracy 
relation one can put for a, 6: A ^ B 

at>b ^ \/x eA MyeB y^J^y) ^ p^^(|/). 

In each fuzzy IT-category these two choices lead to two different infor¬ 
mativeness relations, namely the strong and the weak ones. 

Like in the categories of linear and multivalued ITs discussed above, 
monotone decision-making problems admit restriction of the class of opti¬ 
mal decision strategies to deterministic ITs without loss of quality. 

It was shown in [15] that for every joint distribution in the categories of 
fuzzy ITs there exist conditional distributions. It allows Bayesian approach 
and makes use of Bayesian principle in decision problems with a prior infor¬ 
mation for fuzzy ITs [15] (see also [16-18] where connections between fuzzy 
decision problems and the underlying fuzzy logic are studied). 

In this section we introduced only several examples of IT-categories. Let 
us also remark that there is an extensive literature that studies a wide spec¬ 
trum of categories which are close in their structure to IT-categories [35, 
41-46]. 



References 


[1] Sacksteder R. A Note on Statistical Equivalence // Ann. Math. Stat. 
1967. 38 , No. 3. P. 787-795. 

[2] Morse N., Sacksteder R. Statistical isomorphism // Ann. Math. Stat. 
1966. 37 No. 2. P. 203-214. 

[3] Chentsov N. N. Statistical Decision Rules and Optimal Inference [in 
Russian]. Nauka, Moscow, 1972. 

[4] Chentsov N. N. Categories of mathematical statistics // Dokl. Akad. 
Nauk SSSR, 1965. 164 , No.3. P. 511-514. 

[5] Golubtsov P. V. Measurement Systems: Algebraic Properties and Infor- 
mativity // Pattern Recognition and Image Analysis, 1991. 1 No. 1. P. 
77-86. 

[6] MacLane S. Categories for the working mathematician. New York: 
Springer, 1971. 

[7] Herrlich H., Strecker G. E. Category theory. Boston: Allyn and Bacon, 
1973. 

[8] Arbih M. A., Manes E. G. Arrows, structures and functors. New York: 
Academic Press, 1975. 

[9] Goldblatt R. Topoi. The Categorial Analysis of Logic. North-Holland, 
Amsterdam, 1979. 

[10] Moskaliuk S. S. The method of categorical extension of the theory of 
Cayley-Klein groups (to appear). 

[11] Golubtsov P. V. Informativity in the category of linear measurement 
systems // Probl. Inf. Transm. 1992. 28 , No. 2. P. 125-140. 

[12] Golubtsov P. V. Relative informativity and a priori information in the 
category of linear information transformers // Probl. Inf. Transm. 1995. 
31 , No. 3. P. 195-215. 

[13] Golubtsov P. V., Eilatova S. A. Multivalued measurement-computer sys¬ 
tems // Mat. Model. 1992. 4 , No. 7. P. 79-94. 



[14] Golubtsov P. V. Informalivity in the category multivalued information 
transformers 

Probl. Inf. Transm. 1998. 34 , No. 3. P. 259-276. 

[15] Golubtsov P. V. Theory of fuzzy sets as a theory of uncertainty and 
decision-making problems in fuzzy experiments // Probl. Inf. Transm. 
1994. 30 , No. 3. P. 232-250. 

[16] Golubtsov P. V. Fuzzy Logical Semantics of Bayesian Decision Making 
// SPIE Proceedings. V. 2493. Applications of Fuzzy Logic Technology 
11. Orlando, Florida, 1995. P. 228-239. 

[17] Golubtsov P. V. Categories of Information Transformers and the Con¬ 
cept of Informativity // Proc. Int. Conf. on Informatics and Control 
(ICI&C’97). St.-Petersburg, Russia, 1997. V. 2. P. 512-517. 

[18] Golubtsov P.V., Moskaliuk S. S. Bayesian decisions and fuzzy logic. ~ 
Vienna, 1998.- 8 p. (Preprint/ The Erwin Schrodinger International In¬ 
stitute for Mathematical Physics, ESI No. 626). 

[19] Moskaliuk S.S., Vlassov A.T. On some categorical constructions in 
mathematical physics // Proc. of the 5th Wigner Symposium.- Singa- 
pure: World Scientihc.- 1998. P. 162-164. 

[20] Moskaliuk S.S., Vlassov A. T. Double categories in mathematical physics 
// Ukr. Journ. Phys.- 1998. 43 , No. 7. P. 162-164. 

[21] Golubtsov P. V. Axiomatic description of categories of information 
transformers // Probl. Inf. Transm. 1999. 35, No.3. P. 80-99. 

[22] Moskaliuk S.S. The method of additional structures on the objects of 
a category as a background for category analysis in physics // Ukr. J. 
Phys. 2002. 46 , No. 3. P. 51-58. 

[23] Barr M., Wells G. Toposes, Triples and Theories. New York: Springer, 
1984. 

[24] Borovkov A. A. Mathematical Statistics. Supplementary Chapters [in 
Russian]. Nauka, Moscow, 1984. 



[25] Blackwell D. Comparison of Experiments // Proc. Second Berkeley Sym- 
pos. on Mathematical Statistics and Probability. University of California 
Press, 1951.P. 93-102. 

[26] Blackwell D. Equivalent Comparison of Experiments // Ann. Math. 
Stat. 1953. 24 , No. 2. P. 265-272. 

[27] Crane L., Frenkel LB. Four-dimensional topological quantum held the¬ 
ory, Hopf categories and the canonical bases // J. Math. Phys. 1994. 
35. P. 5136-5154. 

[28] Neuchl M. Representation theory of Hopf categories, Ph.D. thesis. 

[29] Lusztig G. Quivers, perverse shaves and quantized enveloping algebras 
// J. American Math. Soc. 1991. 4 , No. 2. P. 365-421. 

[30] Lybashenko V. Operations and isomorphisms in a triangulated Hopf cat¬ 
egory // Methods of Func. Analysis and Topology. 1999. 5, No. 4. P. 
37-53. 

[31] bernstein J., Lunts V. Equivariant sheaves and functors. Lecture Notes 
in Math. V. 1578. Springer: Berlin-Heidelberg, 1994. 

[32] Baez J.C., Dolan J. Higher dimensional algebra and topological quan¬ 
tum held theory // J. Math. Phys. 1995. 36. P. 6073-6105. 

[33] Street R. The role of Michael Batanin’s monoidal globular categories. 
Notes of lectures at the conference on higher category theory and math¬ 
ematical physics. Northwesten University: Evanston, Illinois, 1997. 

[34] Tamsamani Z. Sur de notion de n-categorie et n-groupoide non-stricte 
via des ensembles multi-simpliciaux. Thesis, Universite Paul Sabatier, 
Toulouse 1996 (available agl-geom 95-12 and 96-07). 

[35] Manes E. G. A class of fuzzy theories // J. Math. Anal, and Appl. 1982. 
85. P. 409-451. 


[36] Blackwell D., Girschick M. A. Theory of games and statistical decisions. 
Wiley Sons: New York, 1954. 



[37] Giry M. A categorical approach to probability theory // Categorical 
Aspects of Topology and Analysis. Lecture 

Notes in Mathematics. No. 915. Berlin: Springer-Verlag. 1982. P. 68-85. 

[38] Neveu J. Bases mathematiques du calculus des probabilites. Masson et 
Cie: Paris, 1964. 

[39] Barra J.-R. Notions fondamentales de statistique mathematique. 
Dunod: Paris, 1974. 

[40] Pyt’ev Yu. P. Reduction problems in experimental research // Mat. Sb. 
1983 120 , No. 2. P. 240-272. 

[41] Goguen J. A. L-fuzzy sets // J. Math. Anal, and Appl. 1967. 18, P. 
145-174. 

[42] Goguen J. A. Categories of V-sets // Bull. Amer. Math. Soc. 1969. 75. 
P. 622-624. 

[43] Goguen J. A. Concept representation in natural and artihcial languages: 
axioms, extensions, and applications for fuzzy sets // Int. J. Man- 
Machine Studies. 1974. 6. P. 513-561. 

[44] Arbib M. A., Manes E. G. Fuzzy machines in a category // Bull. Austral. 
Math. Soc. 1975. 13 , P. 169-210. 

[45] Arbib M. A., Manes E. G. Fuzzy morphisms in automata theory // 
Category Theory Applied to Computation and Control. Lecture Notes 
in Computer Science. No. 25. New York: Springer-Verlag, 1976. P. 80- 
86 . 


[46] Manes E. G. Algebraic theories. Springer: New York, 1976. 



